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Disordered Heteropolymers with Crosslinks - Phase Diagram and Conformational 

Transitions 

Lorin Gutman and Eugene Shakhnovich 
Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138 

We study the phase behavior of random heteropolymers (RHPs) with quenched cross-links, a 
novel polymer class of technological and biological relevance, and show the possible occurrence of 
freezing with few chain conformations sampled. The sensitivity of the frozen phase microstructure to 
the disorder components is elucidated at positive solubility parameter values; at low T's segregated 
microphases form, while at a finite T, a first order conformational transition occurs, and is attributed 
to statistical matching of large microphases bounded by cross-links. The end of the symmetry broken 
regime stabilization by cross-links occurs at a higher T by a second order conformational transition. 
{submitted to Chem. Phys. Lett.) 



Random Heteropolymers (RHP) with some monomers 
bonded by quenched cross-links may be viewed as a 
natural offspring of two structurally remote classes of 
polymers, cross-linked homopolymeric networks 0, 
and linear RHPs Q— Q, both renown for their impact 
on technology and biology. The theoretical analysis of 
cross-linked RHPs is eminently challenging; two frozen 
disorders are carried by the polymer; the first disor- 
der component is the sequence distribution of disparate 
monomers that has been fixed during synthesis: for linear 
RHPs known manifestations of having a fixed sequence 
of segments in polymer properties are mesophase for- 
mation due to spatial grouping of monomers with sim- 
ilar composition Q|— ||], and the reduction of overall 
chain conformations to a small number of predominant 




Analytical [Q- 
131 studies of linear 



folds (frozen phase formation) 
[H and computer simulations ]lO|| — 
RHP chains clarified major questions on protein folding 
possible mechanisms, thermodynamics and provided in- 
sight into the dynamic pathways of folding of proteins 
to their native conformation. The main past study em- 
phasis on crosslinked chains has been on crosslinked ho- 
mopolymers, the earliest proeminent work in the field 
has been by Deam and Edwards 0|; recent advances at- 
tained in the understanding of cross-linked homopolymer 
networks have been discussed and reviewed by Goldbart 
and co-workers ||^ and Panyukov 

A detailed modeling of crosslinked RHPs is challeng- 
ing, while complex phase behavior is expected in com- 
parison with linear RHPs; the quenched sequence dis- 
tribution of segments fixed by the chain connectivity of 
the incipiently linear RHP was shown to preclude 
macrophase separation by formation of microdomains. 
Crosslink formation in linear RHPs, and the nature of 
these crosslinks (quenched or annealed) are expected to 
significantly impact the formation and stability of mi- 
crophases, and their interdomain interfaces as well as the 
overall number of possible accessible chain folds. 

The weaker disorder - the sequence distribution of seg- 
ments reminiscent of linear RHPs, and the strong disor- 



der component - the fixed junction network reminiscent 
of crosslinked homopolymers, both present in our system 
cannot be treated as annealed disorders. This important 
issue is addressed in the present work by the extension of 
spin glass methodology of analysis of systems with one 
quenched disorder pd| ] ; our formalism allows the explicit 
analytic treatment of the quenched nature of both the 
sequence and crosslink distribution. The phase behavior, 
the conformation chain organization in these phases, the 
interrelation of spatial monomer separation by composi- 
tion to the occurrence of phases of few dominant folds, 
and the transition between these phases is studied. 
Theory Development 

Imagine a dilute solution of statistical two-letter het- 
eropolymers. Composition specific crosslinking agents 
are introduced in solution and intrachain composition 
specific junctions form; the A and B segments residing in 
proximal spatial range can form homogeneous crosslinks 
( A-A and B-B type) and heterogeneous ones ( A-B type) . 
Upon completion of the crosslink synthesis, the crosslinks 
are quenched by chemical means and no further crosslink 
reorganization can occur. For example in proteins cross- 
links are quenched by irreversible reactions with iodoac- 
etamide or by acidification jl^ . 

The microscopic interactions for the incipient two- 
letter linear RHP are described by a continuous mi- 
croscopic Hamiltonian representation inspired from Ed- 
wards work on homopolymers p6| . r(n) is the spatial lo- 
cation of the n'th segment, while 9{n) monitors the chem- 
ical composition of the n'th segment. 9{n) = 1 for an A 
segment and 9{n) = — 1 for a B segment. The coarse- 
grained description of the segment composition fluctua- 
tions for statistical RHPs along the chain contour obeys 
a Gaussian process with mean, < 6 >= 2/ — 1, and se- 
quence fluctuations [0- < {5e{n)5e{n')) >^S{n~ 
ri')4/(l — /)/; 1 is the statistical segment length, and f is 
the fraction of A segments. The Hamiltonian is: 
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The first term in eq. |i| represents the nearest neighbor 
interaction and accounts for the chain flexibihty. In the 
second term of eq. ^ /5i(r) represents the microscopic 
density composition components given by: 



/5^(r)= / dn-il + ein))Sir-r{n)) ; 

PB{r) = I dn^{l-e{n))6{r-r{n)) (2) 



Vy, the binary inter-segment interactions, are repre- 



sented by the following matrix: 



V,- 



VAA{r-r') VAB{r-r') 
VsAir-r') VBBir-r') 



(3) 



VAyi(r-r'), Vyi_B(r-r') and yBB(r-r') represent A-A, A- 
B, and B-B segment-segment interactions, respectively. 
Quenched composition specific crosslinks of type A-A, B- 
B, or A-B are described in our theory as instantaneous 
spatial constraints imposed on the partition function of a 
linear RHP. A general composition specific crosslink has 
the form S{ri{n) — rj{n')) where i,j =A,B. 

Fluctuations in the total number of cross-links be- 
tween different disorder realizations of crosslinks are al- 
lowed by a Poisson distribution; this choice has been re- 
cently discussed , , in context of homopoly- 
mer network studies. Thus, a fixed number of A-A, 
A-B and B-B cross-links is linearly parameterized by 
fJ■AA,^J■BB and fiAB , and the probability of fluctuations 
around haAt^-bb and jiAB is given by: 



p[^JL's 



K 



_ [HAa] [M^b] [^J^BB 

' ~ M! J\ K\ 

exp[~{^lAA + fJ-BB + IJ-Ab) 



(4) 



Thus, the partition function of an RHP chain with a fixed 
sequence and constrained by M crosslinks of type A-A, 
K crosslinks of type B-B, and J cross-links of type A-B 
is given by: 



Z[e{n)\^ J J Dr{n)exp{-— J r{nfdn + 



i,j=A,B 
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{ dv{pA{vY) 
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{ dvpB{vY) 



j^ij drpA{r)pB{r)y 



(5) 



In eq. |^ we have expressed the crosslinking constraints by 
microscopic sequence-dependent-composition-densities 
(viz. eq. ||). The spatial r integration and the coeffi- 
cients in front of the constrains accounts for formation 
of all possible crosslinking constraints that are consistent 



with crosslinking from an equilibrium ensemble of com- 
position specific contacts, and reminiscent of the typical 
experimental chemical crosslinking process considered 
here. 

The free energy computation of our problem requires 
to average the logarithm of the partition function, Z given 
in eq. ^, since both the sequence and the crosslinks are 
quenched: 



(6) 



[9] represents one fixed sequence realization while [r^] are 
the spatial coordinates of one fixed crosslink realization. 
Pi ([6*]) is the probability distribution for the synthesis 
of one quenched sequence, while P2([^'], [ri]) is the con- 
ditional probability distribution of one specific realiza- 
tion of composition specific cross-links given a preexisting 
fixed sequence; thus at each given sequence realization, 
crosslinks can form by fixing some composition specific 
intersegment contacts from spontaneously occuring chain 
conformations adopted by the linear RHPs. 

The crosslink average is performed first by introduc- 
ing 3(n-|-l) identical copies of the system; this mathemat- 
ical trick originally introduced by Deam and Edwards 0] 
in their studies of crosslinked homopolymers, is an exact 
and non-perturbative way for fixing intersegment con- 
tacts from spontaneously occuring binary contacts from 
an equilibrium distribution of chain conformations in ho- 
mopolymers [Q: 
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Our definition of this averaging trick is slightly differ- 
ent from Deam and Edwards formulation; in our formu- 
lation the derivative of log(Z) allows in the present prob- 
lem an exact representation of the sequence dependent 
denominator occuring due to the probability normaliza- 
tion, in a nominator form, and sets the ground for an 
exact analytical sequence average later on. 

The crosslink average of the log of the partition func- 
tion given in eq. ^ is performed with the Poisson distri- 
bution given in eq. ^by summation over M, J, L values. 

Next we perform the average over disordered se- 
quence; the log generated in the crosslink average (viz. 
eq. 1^) is averaged now over the sequence by using one 
more time our averaging trick; thus, the free energy is 
now given by: 

^ = (4-) « (^("+''®") »s,,cl (8) 

Interestingly, the sequence average of the log generates 
in turn a new index m of copies. 
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All together, to ® (n + 1) = / is the total number of 
copies of our system. Methods developed in the field of 
spin glasses by Parisi are extended herein, and used 
to obtain an analytical solution. The sequence distribu- 
tion average is carried out here exactly in the usual way 

As in previous RHP studies, cross-linked RHPs in 
compact states are of interest; under these conditions a 
scaling argument shows that it is reasonable to com- 
pute the free energy by a one-step mean-field Parisi p^ ] 
calculation. It was shown that the reduction in the to- 
tal number of chain conformations to a small number of 
dominant folds, a phenomena usually termed in the RHP 
literature as freezing into a few chain conformations, is 
described by one order parameter ^; the polymer coor- 
dinates, (the annealed degrees of freedom) equilibrate in 
response to the quenched disorder realizations regardless 
of the disorder source, sequence, crosslinks or both; thus, 
in the present case one freezing order parameter mea- 
sures the total reduction of chain conformations due to 
energetic and entropic constraints. The physical meaning 
of ^ is as follows: 

^, = 1-J2p^ (9) 

Pi is the probability of the i'th chain conformation. For 
many chain conformations, each conformation has a low 
probability of occurrence, and practically equals one. 
If the chain is collapsed in one conformation, Pi=l, while 
for other chain conformations where j, Pj—0 implying 
that ^0=0. 

In the present case, not like in the linear RHP case, 
^ is an explicit function of the indices, (a, k). A suitable 
one step parameterization of the freezing order parameter 
5, is ^0 = xqx'q; m and n form a tensor space, I = m^n; xq 
and the one step continuous parameterizations ]23| ] 

of the indices a (crosslinks sector), and k (sequence sec- 
tor), respectively. A separate publication is now in print 
wherein all mathematical details and derivations are pre- 
sented [Q. The scenario of = 1, < xo < 1 is de- 
fined as sequence- induced- freezing, the scenario xq = 1, 
< Xq < 1 is defined as crosslink-induced-freezing while 
the parameter regime of < xq < 1, < Xq < 1 is 
defined as sq.-|-cl.-induced- freezing as obviously inferred 
by the parameterization relevant in determination of the 
overall number of folds. These convenient definitions al- 
lows to keep a good track of the numerical results while 
we discuss the physics but should not be taken too liter- 
ally; first, since the true order parameter that monitors 
the occurrence of few dominant folds is while xq and 
Xq are parameterizations only; also, as our calculations 
show, and is inferred by the crosslinking procedure the 
disorder components are strongly correlated, and it is not 
possible to completely separate the component disorder 
manifestations in the conformational organization of the 
crosslinked RHP. 



Using Parisi commutation relations |2^] , the total free 
energy per monomer for an RHP in compact state with 
composition specific and quenched cross-links is: 

F = ^{1-2{xo + x'q) + 
xoXq 

(2x[,-l)(xo-l)(xo-2)2) 
+ '"^^^"r°'"^ -V(M'^,/)(xo-l) 

XqXq 

^" = T^^'"^o.25(2y,,+V.. + y..) ^^0) 

with: 

V(/i's) = 0.5{flAA + ^J-BB + 2^As) 

0.5V(/i's) - 0.25(maa + ^ibbW - Q.2bxFci{ti' s)f 
XF = 0.5(2^45 - Vaa - Vbb) 
XFciitJ.' s) = 0.5(2^AS - ^.AA - ^J-bb) 

(11) 

Note the important consistency requirement that our pa- 
rameterization obeys: for x'q = 1., the free energy in eq. 
[lO| is tantamount the free energy of an RHP with an- 
nealed crosslinks, a problem that has been recently an- 
alyzed I^J]. Next, we compute numerically the stability 
of F with respect to xq and x'q in all parameter regimes. 
While the value of obtained is used to determine the 
occurrence of few dominant folds, the stability of the free 
energy and the value of the parameterizations of xq 
and xq\ provides essential information on the sensitivity 
of the microdomain structure of the frozen phase to the 
quenched sequence distribution and the fixed crosslink 
realizations. Let us now explicitly explore the conforma- 
tional organization of RHPs in the globular phase. The 
scenario depicted in fig. 1 corresponds to having a small 
number of cross- links of type A-B, and a positive Xf (seg- 
ments are encouraged to group with alike). The physics 
displayed by fig. 1 with regard to the conformational or- 
ganization of the RHPs is qualitatively illustrated in fig. 
2. At low T's, few dominant RHP folds occur due to 
formation of energetically driven microphases with seg- 
regated domain structure and having sharp interfaces. 
The heterogeneous cross-links formation, following the 
crosslinking procedure, nucleates at the A-B interfaces of 
the segregated microphases, and reduces the microphase 
interfacial free energy since each crosslink is an entropic 
constraint; thus chain reorganizations in this regime is 
most sensitive to the crosslink disorder component. The 
spatial microphase organization within the low tempera- 
ture frozen phase is qualitatively represented in fig. 2; a 
heterogeneous cross-link is marked by two adjacent small 
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Fig. 2; Qualitative illustration of crosslink location among 
segregation patterns emerging within the frozen phase. Red 
and yellow regions depict microphases of A and B segments 
respectively, while the color intensity denotes the segregation 
strength. Two small adjacent circles represent two crosslinked 
monomers; red small circle represents a monomer of type A, 
while a yellow small circle represents a monomer of type B. 

In the language of critical phenomena this would im- 
ply the occurrence of a coexistence regions between two 
frozen-globular RHP phases with an equal number of 
folds in each phase dominated in one phase by the se- 



quence disorder and in the other phase by the cross-links 
disorder component. In the present system, is the only 
physically meaningful order parameter that measures the 
reduction in chain conformations, and thus monitors the 
occurrence of freezing; Xq and Xq' parameterize ^o- As a 
result, the coexistence of separate individual such phases 
is not possible. Our numerical calculation confirms this 
fact, and the "coexistence" point does not occur; at Ti, 
a new first order conformational transition occurs to a 
more stable phase wherein the overall number of few 
dominant folds is abruptly reduced, as marked by the 
spike in (viz. red line in fig. 1); our numerical calcu- 
lations show that this new globular phase is character- 
ized by the occurrence of symmetry breaking induced by 
both crosslinks and sequence, thus we name it as mixed- 
frozen-globular phase. Let us now provide a physical in- 
terpretation of this conformational transition. Below Ti , 
the conformational organization of the RHP is primar- 
ily determined by the inter-segment interaction strength. 
In the vicinity of Ti the existence of some cross-links in- 
duces formation of large domains bounded by cross- links. 
At the onset of formation of the mixed-frozen-globular 
state at Ti, the microphases are large in size and have 
diffuse interfaces (viz. fig. 2 - the seq. + cl. induced 
freezing). Since the crosslinks are heterogeneous, most 
likely they will form at this diffuse interfaces, decreas- 
ing the interface flexibility and entropy. But the penalty 
due to local deformation of large cross-linked bounded 
domains is small, and a statistical pattern matching of 
microphases belonging to separate cross-linked bounded 
domains occurs. The frozen globule occurs here due to 
both sequence and crosslink disorder components; only a 
few chain conformations allow prefered pattern matching 
of microphases over length scales as large as the size of 
cross-linked bounded domains, a realization which may 
explain the sharpness of the spike in fig. 1. The coop- 
erativity observed here reminds to a large extent of the 
cooperativity observed in the folding of disulfide bonded 
proteins to their native state ||l^ . In the folding scenario 
of crosslinked proteins, crosslinks are also composition 
specific but homogeneous, they nucleate within the seg- 
regated domains. The apparently negative heat capacity 
inferred in fig. 2 is not inconsistent with thermodynam- 
ics; fig. 1 displays the temperature dependence of dif- 
ferent thermodynamic systems and not phase behavior 
variation within the same thermodynamic system. This 
fact is a result of the experimental crosslinking proce- 
dure. At each temperature within equilibrated globular 
phases of linear RHPs, quenched crosslinks are formed. 
Thus at each T the system constraints (here the quenched 
crosslinks) are different which implies that the phase be- 
havior comparison in fig. 1 is indeed between different 
thermodynamic systems at different temperatures. The 
complementing future study scenario, is the phase behav- 
ior of a linear RHP system crosslinked at one temper- 
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ature, and subject to temperature variations while the 
crosshnk reahzation of the initial crosslinking tempera- 
ture is retained. 

Let us now return to the phase behavior analysis of 
fig. 1. At T2 (viz. fig. 1), our numerical calculation 
shows that the mixed-frozen-globule becomes unstable, 
while the sequence-frozen-globular phase is stable and it 
should be observed as it has the lowest chemical potential 
(viz. fig. 1 green line). The occurrence of this contin- 
uous conformational transition is expected; the number 
of cross-links is small while the reduction in the number 
of dominant folds due to cross-links only cannot occur 
at high temperatures. The T > T2 frozen regime should 
be characterized by weaker and diffuse microphases (viz. 
fig. 2 - sequence induced freezing). A good candidate 
for testing our predictions is the ensemble growth Monte 
Carlo method This approach has been successfully 
implemented in the study of linear RHPs in charged dis- 
order |25|, and in confined geometries, and allowed a 
faithful comparison with prior analytical calculations on 
RHPs [|, §. 
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